Ranking Information in Networks

نویسندگان

  • Tina Eliassi-Rad
  • Keith Henderson
چکیده

Given an information network, we are interested in ranking sets of nodes that score highest on user-specified criteria. Examples include (1) discovering sets of authors with expertise in a wide range of disciplines; (2) finding sets of patents which, if removed, would have the greatest effect on the patent citation network; (3) identifying small sets of IP addresses which taken together account for a significant portion of a typical day’s traffic; and (4) detecting sets of countries whose vote agree with a given country (e.g., USA) on a wide range of UN resolutions. We present this ranking task as a Top-K problem. We assume the criteria is defined by L real-valued features on nodes. This gives us a matrix F , whose rows represent the N nodes and whose columns represent the L real-valued features. Examples of these node-centric features include degree, betweenness, PageRank, etc. We define an L-tuple �v1, . . . , vL� as a selection of one value from each column. The score of an L-tuple is equal to the sum of the selected values (i.e., sum of the tuple’s components). For a given parameter K, we require an algorithm that efficiently lists the top K L-tuples ordered from best (highest score) to worst. The L-tuple with the highest score corresponds to a set of nodes in which all features take on optimal values. As described in the next section, we solve the Top-K problem by utilizing SMA* [1], which is a fixedmemory heuristic search algorithm. SMA* requires the specification of an additional parameter M for the maximum allotted memory-size. The choice for M has a dramatic effect on runtime. To solve this inefficiency problem, we introduce a parallelization of SMA* (on distributed-memory machines) that increases the effective memory size and produces super-linear speedups. This allows use to efficiently solve the Top-K ranking problem for large L and K (e.g., L ∈ [102, 103] and K ∈ [106, 109]). Experiments on synthetic and real data illustrate the effectiveness of our solution.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Investigating the Impact of Authors’ Rank in Bibliographic Networks on Expertise Retrieval

Background and Aim: this research investigates the impact of authors’ rank in Bibliographic networks on document-centered model of Expertise Retrieval. Its purpose is to find out what kind of authors’ ranking in bibliographic networks can improve the performance of document-centered model.   Methodology: Current research is an experimental one. To operationalize research goals, a new test colle...

متن کامل

Ranking Methods for Networks

Glossary Ranking: sort objects according to some order. Query-dependent ranking: objects are assigned with different ranks according to different queries. Proximity ranking: objects are ranked according to proximity or similarity to other objects. Homogeneous information network: networks that contain one type of objects and one type of relationships. Heterogeneous information network: networks...

متن کامل

Design of cybernetic metamodel of cryptographic algorithms and ranking of its supporting components using ELECTRE III method

Nowadays, achieving desirable and stable security in networks with national and organizational scope and even in sensitive information systems, should be based on a systematic and comprehensive method and should be done step by step. Cryptography is the most important mechanism for securing information. a cryptographic system consists of three main components: cryptographic algorithms, cryptogr...

متن کامل

Finding Influential Institutions in Bibliographic Information Networks

Ranking in bibliographic information networks is a widely studied problem due to its many applications such as advertisement industry, funding, search engines, etc. Most of the existing works on ranking in bibliographic information network are based on ranking of research papers and their authors. But the bibliographic information network can be used for solving other important problems as well...

متن کامل

Clustering and Ranking in Heterogeneous Information Networks via Gamma-Poisson Model

Clustering and ranking have been successfully applied independently to homogeneous information networks, containing only one type of objects. However, real-world information networks are oftentimes heterogeneous, containing multiple types of objects and links. Recent research has shown that clustering and ranking can actually mutually enhance each other, and several techniques have been develop...

متن کامل

Clustering and Ranking in Heterogeneous Information Networks via Gamma-Poisson Model | Proceedings of the 2015 SIAM International Conference on Data Mining | Society for Industrial and Applied Mathematics

Clustering and ranking have been successfully applied independently to homogeneous information networks, containing only one type of objects. However, real-world information networks are oftentimes heterogeneous, containing multiple types of objects and links. Recent research has shown that clustering and ranking can actually mutually enhance each other, and several techniques have been develop...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011